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Abstract 

Let 5 be a finite set of words such that Fact(S*) =^ E*. We deal with the problem of 
finding bounds on the minimal length of words in E* \ Fact(S*) in terms of the maximal 
length of words in S. 

1 Introduction 

A finite set S of (finite) words over an alphabet £ is said to be complete if Fact(S*), the set of 
factors of S*, is equal to £*, that is, if every word of E* is a factor of, or can be completed by 
multiplication on the left and on the right as, a word of S* . 

If S is not complete, E* \ Fact(S*) is not empty and a word in this set of minimal length is 
called a minimal uncompletable word (with respect to the non-complete set S). 

The problem of finding minimal uncompletable words and their length was introduced by 
Restivo [1], who conjectured that there is a quadratic upper bound for the length of a minimal 
uncompletable word for S in terms of the maximal length of words in S. 

A more general related question of deciding whether a given regular language L satisfies one 
of the properties E* = Fact(L), E* = Pref(L), E* = Suff{L) has been recently considered by 
Rampersad et al. in [3] , where the computational complexity of the aforesaid problems in case L 
is represented by a DFA or NFA is studied. In the particular case L = S* for S being a finite set 
of words - which is the case that is of interest for us - the authors mention that the complexity 
of deciding whether or not E* = Fact(S*) is still an open problem. 

In this note, we show by mean of an example that the length of a minimal uncompletable 
word for a set S whose longest word is of length k seems to grow as 3A: 2 asymptotically and at 
least gets larger than 2k 2 for effectively computed values, thus improving on a previous example 
given by Antonio Restivo 0], The computations of a minimal uncompletable word for the 
successive values of k in the parametrized example were made on the VAUCANSON platform for 
computing automata [5] . This result is briefly mentioned in [TJ . 

The previous attempts to studying non-complete sets of words lead us to the following 
formulation. 
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Let SCT,* and denote by 



uwl(S) 

and by 



min{|a;| : x £ E* \ Fact(S*)} if E* \ Fact(S*) / 0, 
otherwise 



UWL(k,a) = m&x{uwl(S) : S C E^ fc , |E| = cr} 



In fact we shall be interested by the case of binary alphabet, and we write UWL(k) — 
UWL(k, 2). The problem is to find upper and lower bounds for UWL(k). 

2 Bounds on the length of minimal uncompletable words 

Proposition 2.1. OV Let k be an integer and let S be a finite set of words whose maximal length 
is k and such that there exists a word u of length k with the property that no element of S is a 
factor of u. Then S is non-complete and the word 

w = (ua) k ~ 1 u with a G E 

is an uncompletable word for S. 

A direct consequence of this statement is then 

Corollary 2.2. OV For any integer k > 2 and any word u in E fc 7 the set S — E fc \ {u} is 
non- complete. 

Actually, if S = E fc \ {u} and u is an unbordered word, it can be proved that the uncom- 
pletable word from Proposition 12. II is also the shortest such word: 



Proposition 2.3. '21 For any integer k > 2 and any unbordered word u £ E fc , a shortest 
uncompletable word of S = E* \ {u} has length k 2 + k — 1. 

Corollary 2.4. For any k > 2 we have UWL(k) > k 2 + k — 1. 

Of course, if S contained in E* is non-complete and if SUT is also contained in E* and non- 
complete, any uncompletable word for S U T is uncompletable for S and uwl(S U T) > uwl(S). 

The "game" is thus to start from a set S of the form E* \ {u} and to find a subset T of 
words of length shorter than k such that SUT remains non-complete and the length of minimal 
uncompletable words increases as much as possible. This is the way that the bound k 2 + k — 1 
was already improved in [J: 

Example 1. Let k = 4 and let 

54 = E 4 \ {aabb} U {ab, ba, aba, baa, bab, bba] 

Then 

w = (aabb)aaa(aabb)baa(aabb)bbb(aabb) 
is a minimal uncompletable word for S4. 
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Note that in this example the shortest uncompletable word maintains the structure uv\uv2 ■ ■ ■ uv k ^\u 
of the uncompletable word from Proposition 12. 1[ but the intermediate words Vi this time have 
length k — 1. This example led Rcstivo to conjecture that UWL(k) < 2k 2 . More precisely: 

Conjecture. 0] If S is a non-complete set and k is the maximal length of words in S, there 
exists an uncompletable word of length at most 2k 2 . Moreover this word is of the form 
UV1UV2U ■ ■ ■ uvk-iu, where u is the suitable word of length k and V\, V2, • ■ ■ , 1 are words 
of length less than or equal to k. 

Example 2. Let k > 4 and let 

S k = S fc \ {a k ~ 2 bb} U £6a fe - 4 £ U E6a U J k 

where J k = Ui=i 3 (^ aI ^ U a l b). We computed that for 5 < k < 12 the word 

w = (a k ~ 2 bb)a k '\a k ~ 2 bb)ba k - 2 {{a k - 2 bb)b 2 a k ^) k -\a k - 2 bb) 

is a minimal uncompletable word for S k - Thus UWL(k) > 2k 2 — 2k + 1 for 5 < k < 12. Using 
a similar technique as in [2 , it can be proved that this word is uncompletable for each k > 5, 
but we are not aware whether this word is minimal uncompletable for k > 12. 

Unfortunately, it is not true in general that UWL(k) < 2k 2 . Indeed, we have 

Example 3. Let k > 6 and let 

S' k = S fe \ {a k - 2 bb} U E&a^E U S6a U b 4 U J k 
where J k = \J^(ba i T, U a l b). We computed that, for 7 < k < 12, 

w = (a fc - 2 66)a' £ - 1 (a fe - 2 66)6a fc - 4 ((a' £ - 2 66)6a(a fe - 2 66)66a' £ ~ 5 )' £ - 6 
(a fc - 2 66)a6(a' £ " 2 66)66a fe - 3 (a' £ - 2 66)6a' £ - 3 (a fe - 2 66) 

is a minimal uncompletable word for S' k . Thus UWL(k) > 3fc 2 — 9A; + 1 for 7 < < 12. 
The set is obtained from the set Sk considered in Example [5] by adding just the word b 4 . 

3 On the structure of minimal uncompletable words 

Let u be an unbordered word of length k, and S = S fe \ {u}. Any uncompletable word for S 
must contain the word u as a factor, and any word that contains an unbordered factor u can be 
uniquely written under the form 

W = V Q UViUV 2 ■ ■ ■ v m uv m+1 

with Vi £ E* \ E*uS*. 

Actually, we can say a little bit more on the structure of minimal uncompletable words. 
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Proposition 3.1. Let u be an unbordered word of length k and S = Y. k \ {u}. Then u is both a 
prefix and a suffix of any minimal uncompletable word for S , that is, any minimal uncompletable 
word for S is of the form 

W = UViUV 2 U ■ ■ ■ v m u 

with Vi G £* \ ZTitX*, m > 1. 

Proof. Let w be any minimal uncompletable word for S. Arguing by contradiction, suppose that 
u is not a suffix of w. Let w = w'x, with x G S. By the minimality of w, we have w' G Fact(S*), 
i.e. w' can be covered by words in S. Since S does not contain words longer than fc, there must 
exist a prefix p of w' such that p G Suff(S*) and |p| > |u>'| — fe, i.e. \p\ > \w\ — k. But then w 
could be written as w — pz, with \z\ < k and z =/= u. This implies that w could be covered by 
words of S, which is a contradiction. 

In an analogous way one can prove that u must be a prefix of w. 

□ 

Note that Proposition 13.11 still holds for non-complete sets of the form S = S fe \ {u} LIT, for 
u an unbordered word of length k and T a set of words of length shorter than k. 

What about the lengths of factors u,'s? In all the examples above each m has length shorter 
than k. Nevertheless, minimal uncompletable words for which this property is no longer true 
exist. 

Example 4. Let 

S 5 = E fe \ {a 3 bb} U E6aE U S6a U J 5 
where J5 = U^ =1 (6a l E U a l b), the set as in the Example [2] for k = 5. Then 

w = (a 3 bb)aaaa(a 3 bb)baa(a 3 bb)bbabaa(a 3 bb)baa(a 3 bb) 
is a minimal uncompletable word for 5 5 . 
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